CorGen—measuring and generating long-range correlations for DNA sequence analysis
نویسندگان
چکیده
CorGen is a web server that measures long-range correlations in the base composition of DNA and generates random sequences with the same correlation parameters. Long-range correlations are characterized by a power-law decay of the auto correlation function of the GC-content. The widespread presence of such correlations in eukaryotic genomes calls for their incorporation into accurate null models of eukaryotic DNA in computational biology. For example, the score statistics of sequence alignment and the performance of motif finding algorithms are significantly affected by the presence of genomic long-range correlations. We use an expansion-randomization dynamics to efficiently generate the correlated random sequences. The server is available at http://corgen.molgen.mpg.de.
منابع مشابه
Generating Non-trivial Long-Range Correlations and 1/f Spectra by Replication and Mutation
This paper aims at understanding the statistical features of nucleic acid sequences from the knowledge of the dynamical process that produces them. Two studies are carried out: rst, mutual information function of the limiting sequences generated by simple sequence manipulation dynamics with replications and mutations are calculated numerically (sometimes analytically). It is shown that elongati...
متن کاملDNA Sequence Fragment Containing C to A Mutation as a Convenient Mutation Standard for DHPLC Analysis
Objective(s): Denaturing high performance liquid chromatography (DHPLC) is a high throughput approach for screening DNA sequence variations. To assess oven calibration, cartridge performance, buffer composition and stability, the WAVE Low and High Range Mutation Standards are employed to ensure reproducibility and accuracy of the chromatographic analysis. The purpose of this study was to provi...
متن کاملNovel Method for Generating Long-Range Correlations
A b s t r a c t We propose an algorithm to generate a sequence of numbers with long-range power-law correlations which is well-suited for large systems. Starting with a set of random uncorrelated variables, we modify its Fourier transform to get a new sequence with longrange correlations. By mapping the variables to a one dimensional random walk problem we find analytical and numerical evidence...
متن کاملMosaic organization of DNA nucleotides.
Long-range power-law correlations have been reported recently for DNA sequences containing noncoding regions. We address the question of whether such correlations may be a trivial consequence of the known mosaic structure ("patchiness") of DNA. We analyze two classes of controls consisting of patchy nucleotide sequences generated by different algorithms--one without and one with long-range po...
متن کاملThe Lack of Long Range Correlations is a Necessary Condition for a Functional Biologically Active Protein
We study random heteropolymer chain with gaussian distribution of kinds of monomers. The long-range correlations between kinds of monomers were introduce. The mean-field analysis of such heteropolymer indicates the existence of infinite energetic barrier between heteropolymer random coil and frozen states. Thus, the frozen state is kinetically unavailable for the random heteropolymer with power...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Nucleic Acids Research
دوره 34 شماره
صفحات -
تاریخ انتشار 2006